Mixed-domain coding of speech at 3 kb/s
نویسندگان
چکیده
We present a speech coding algorithm called mixeddomain residual coding (MDRC) wherein a prototype pitch cycle in each frame of the speech residual is coded in the time-domain while interpolation of the residual signal is performed in the frequency-domain. A novel quantization scheme takes into account time scaling and di erentially codes successive prototypes with a closed-loop perceptually-weighted search. A xed-rate (3.15 kb/s) implementation of MDRC achieves quality better or comparable to higher rate coders such as FS 1016 CELP and IMBE.
منابع مشابه
A mixed sinusoidally excited linear prediction coder at 4 kb/s and below
There is currently a great deal of interest in the development of speech coding algorithms capable of delivering toll quality at 4 kb/s and below. For synthesizing high quality speech, accurate representation of the voiced portions of speech is essential. For bit rates of 4 kb/s and below, conventional Code Excited Linear Prediction (CELP) may likely not provide the appropriate degree of period...
متن کاملHigh quality MELP coding at bit-rates around 4 kb/s
Recently, a number of coding techniques have been reported to achieve near toll quality synthesized speech at bit-rates around 4 kb/s. These include variants of Code Excited Linear Prediction (CELP), Sinusoidal Transform Coding (STC) and Multi-Band Excitation (MBE). While CELP has been an effective technique for bit-rates above 6 kb/s, STC, MBE, Waveform Interpolation (WI) and Mixed Excitation ...
متن کاملAnalysis-by-synthesis multimode harmonic speech coding at 4 kb/s
This paper presents a 4 kb/s Analysis-by-Synthesis Multimode Harmonic Coder (AbS-MHC). Novel features of this coder include a signal modification technique that allows time-domain analysisby-synthesis parameter estimation in sinusoidal coding framework, and a frequency-domain transition speech model with improved parameter estimation and quantization schemes. An efficient quantization scheme fo...
متن کاملA 1.7 kb/s MELP coder with improved analysis and quantization
This paper describes our new Mixed Excitation Linear Predictive (MELP) coder designed for very low bit rate applications. This new coder, through algorithmic improvements and enhanced quantization techniques, produces better speech quality at 1.7 kb/s than the new U.S. Federal Standard MELP coder at 2.4 kb/s. Key features of the coder are an improved pitch estimation algorithm and a Line Spectr...
متن کاملStrategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec - Vision, Image and Signal Processing, IEE Proceedings-
This paper presents several strategies to improve the performance of very low bit rate speech coders and describes a speech codec that incorporates these strategies and operates at an average bit rate of 1.2 kb/s. The encoding algorithm is based on several improvements in a mixed multiband excitation (MMBE) linear predictive coding (LPC) structure. A switched-predictive vector quantiser techniq...
متن کامل